Knowledge Representation Issues in Information Extraction

نویسندگان

  • Li Kwang Angela Wee
  • Loong Cheong Tong
  • Chew Lim Tan
چکیده

The advent of computing has exacerbated the problem of overwhelming information. Advanced information management strategies such as Information Extraction, Information Filtering, Information Retrieval, and Text Categorization are becoming important to manage the deluge of information. Information Extraction (IE) systems can be used to automatically extract relevant information from free-form text for update to databases or for report generation. This paper describes the major challenge of knowledge representation issues in an information extraction task – representing the meaning of the input text, the knowledge of the field of application (or domain application) and the knowledge about the target information to be extracted. In this research, we have chosen a directed graph structure to represent the input text meaning, a domain ontology to represent the domain application and a frame representation to capture the target information to be extracted. We discuss in this paper how these knowledge structures interplay to perform the task of information extraction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A VICORE Architecture for Intelligent Knowledge Management

We consider the functionality, architecture, design and implementation issues related to the development of intelligent systems for knowledge management that assist people in finding relevant literature and in discovering new knowledge. We describe a visualized concept representation (VICORE) framework for knowledge management. Special attention is given to the use of concept association matric...

متن کامل

Knowledge Acquisition from Multimedia Content using an Evolution Framework

We propose an approach to knowledge acquisition, which uses multimedia ontologies for fused extraction of semantics from multiple modalities, and feeds back the extracted information, aiming to evolve knowledge representation. This paper presents the basic components of the proposed approach and discusses the open research issues focusing on the fused information extraction that will enable the...

متن کامل

Some empirical findings on dialogue management and domain ontologies in dialogue systems - Implications from an evaluation of BirdQuest

In this paper we present implications for development of dialogue systems, based on an evaluation of the system BIRDQUEST which combine dialogue interaction with information extraction. A number of issues detected during the evaluation concerning primarily dialogue management, and domain knowledge representation and use are presented and discussed.

متن کامل

Hyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations

The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...

متن کامل

AeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages

The DARPA Agent Markup Language (DAML) is an emerging knowledge representation for the Semantic Web. DAML can encode the semantics of a document for use by agents on the web. However, DAML annotation of documents and web pages is a tedious and time consuming task. AeroDAML is a knowledge markup tool that applies natural language information extraction techniques to automatically generate DAML a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998